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41 Information retievalk session 9: language models: A unified model for metasearch, loo 
12 pooling, and system evaluation 

Javed A. Aslann , Virgiliu Pavlu , Robert Save!! 

Proceedings of the twelfth international conference on Information and knowledge 

management November 2003 

We present a unified model which, given the ranked iists of documents returned by muitiple 
retrieval systems in response to a given query, simultaneously solves the problems of (1) fusing the 
ranked lists of documents in order to obtain a high-quality combined list (metasearch); (2) 
generating document collections likely to contain large fractions of relevant documents (pooling); 
and (3) accurately evaluating the underlying retrieval systems with small numbers of relevance 
judgments (efficient s ... 



42 Information retrieval session 8: efficiency: Efficient query evaluation using a two- loo 
level retrieval process 

Andrei Z. Broder , David Carmel , Michael Herscovici , Aya Soffer , Jason Zien 
Proceedings of the twelfth international conference on Information and knowledge 
management November 2003 

We present an efficient query evaluation method based on a two level approach: at the first level, 
our method iterates in parallel over query term postings and identifies candidate documents using 
an approximate evaluation taking into account only partial information on term occurrences and no 
query independent factors; at the second level, promising candidates are fully evaluated and their 
exact scores are computed. The efficiency of the evaluation process can be improved signific ... 



43 Research track: Weighted Association Rule Mining using weighted support and lOO 
12 significance framework 

Feng Tao , Fionn l^urtagh , Mohsen Farid 

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and 

data mining August 2003 

We address the issues of discovering significant binary relationships in transaction datasets in a 
weighted setting. Traditional model of association rule mining is adapted to handle weighted 
association rule mining problems where each item is allowed to have a weight. The goal is to steer 
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the mining focus to those^^nificant relationships involving items ^n significant weights rather 
than being flooded in the combinatornal explosion of insignificant relationships. We identify the 
challenge o ... 



44 Research track: Accurate decision trees for mining high-speed data streams loo 

Joao Gama , Ricardo Rocha , Pedro Medas 
— Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and 

data mining August 2003 

In this paper we study the problenn of constructing accurate decision tree models from data 
streams. Data streams are incremental tasks that require incremental, online, and any-time learning 
algorithms. One of the most successful algorithms for mining data streams is VFDT. In this paper we 
extend the VFDT system in two directions: the ability to deal with continuous data and the use of 
more powerful classification techniques at tree leaves. The proposed system, VFDTc, can 
incorporate and classif ... 



45 Research track: Understanding captions in biomedical publications lOO 

William W. Cohen , Richard Wang , Robert F. Murphy 
— Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and 

data mining August 2003 

From the standpoint of the automated extraction of scientific knowledge, an important but little- 
studied part of scientific publications are the figures and accompanying captions. Captions are dense 
in information, but also contain many extra-grammatical constructs, making them awkward to 
process with standard information extraction methods. We propose a scheme for "understanding" 
captions in biomedical publications by extracting and classifying "image pointers" (references to the 
accompanying i ... 



46 Industrial/governnnent track: The anatomy of a multimodal information filter loo 

Yi-Leh Wu , King-Shy Goh , Beitao Li , Huaxing You , Edward Y. Chang 
— Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and 

data mining August 2003 

The proliferation of objectionable information on the Internet has reached a level of serious concern. 
To empower end-users with the choice of blocking undesirable and offensive websites, we propose a 
multimodal information filter, named MORF. In this paper, we present MORF's core components: its 
confidence-based classifier, a Cross-bagging ensemble scheme, and multimodal classification 
algorithm. Empirical studies and initial statistics collected from the MO/?F filters depio ... 



47 Industrial/govemment track: Frequent-subsequence-based prediction of outer loo 
12 membrane proteins 

Rong She , Fei Chen , Ke Wang , Martin Ester , Jennifer L. Gardy , Fiona S. L. Brinkman 
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and 
data mining August 2003 

A number of medically important disease-causing bacteria (collectively called Gram-negative 
bacteria) are noted for the extra "outer" membrane that surrounds their cell. Proteins resident in 
this membrane (outer membrane proteins, or OMRs) are of primary research interest for antibiotic 
and vaccine drug design as they are on the surface of the bacteria and so are the most accessible 
targets to develop new drugs against. With the development of genome sequencing technology and 
bioinformatics, bio ... 



48 Tools for integrating and querying web information: OntoKlioj: a semantic web lOO 
12 portal for ontology searching, ranking and classification 

Chintan Patel , Kaustubh Supekar , Yugyung Lee , E. K. Park 

Proceedings of the fifth ACM international workshop on Web information and data 

management November 2003 

The goal of the next generation Web is to build virtual communities, wherein software agents and 
people can work in cooperation by sharing knowledge. To achieve this goal, the emerging Semantic 
Wet) community has proposed ontologies to express knowledge in a machine understandable way. 
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The process of building ancrrnaintaining ontologies, winich is known^s Ontology Engineering, 
presents unique challenges. These challenges are related to lack of trustworthy and authoritative 
knowledge sources and ab ... 

49 Extension of spatial metadata and agent-based spatial Data navigation mechanism lOO 

Yingwei Luo , Xiaolin Wang , Zhuoqun Xu 

— Proceedings of the eleventh ACM international symposium on Advances in geographic 
information systems Novennber 2003 

Fast navigation to distributed spatial data has been the keystone of distributed GIS. In this paper, a 
hierarchical spatial nnetadata Database frannework is presented based on the existing spatial 
nnetadata standard. This frannework can efficiently organize the distributed spatial data in network. 
With the support of the hierarchical spatial nnetadata Databases, a user-oriented descriptive 
specification for spatial data requirement is proposed. A map (map-layer) is described as a tuple 
consisting of ... 

50 Maintenance and workload: Maintenance policy selection in heterogeneous data lOO 
12 warehouse environments: a heuristics-based approach 

H. Engstro , S. Chakravarthy , B. Lings 

Proceedings of tiie 6th ACM international workshop on Data warehousing and OLAP November 
2003 

This work addresses data warehouse maintenance, i.e. how changes to autonomous, 
heterogeneous, and distributed sources should be detected and propagated to a warehouse. The 
research community has mainly addressed issues relating to the internal operation of data 
warehouse servers. Work related to data warehouse maintenance has received less attention and 
only a limited set of maintenance alternatives are considered while ignoring the autonomy and 
heterogeneity of sources. In this paper, we extend ... 

51 Infornnation security management: a new paradigm lOO 

Jan Eloff , Mariki Eloff 

— Proceedings of the 2003 annual research conference of the South African institute of 
computer scientists and information technologists on Enablement through technology 

September 2003 

Information security management needs a paradigm shift in order to successfully protect 
information assets. Organisations must change to the holistic management of information security, 
requiring a well-established Information Security Management System (ISMS). An ISMS addresses 
all aspects in an organisation that deals with creating and maintaining a secure information 
environment. Organisational management and their staff to manage information security cost- 
effectively can use the ISMS. It can ... 



52 An investigation of l<nowledge management implementation strategies loo 

Nakkiran N. Sunassee , David A. Sewry 
— Proceedings of the 2003 annual research conference of the South African institute of 
computer scientists and information technologists on Enablement through technology 

September 2003 

This paper details research conducted in the area of knowledge management implementation 
strategies. An investigation of the literature reveals that when organisations initiate a knowledge 
management effort, most of them tend to over-emphasise the role of information technology at the 
expense of the human factor. A preliminary survey of local organisations using Duffy's Knowledge 
Management Benchmarking Questionnaire confirms these findings, and a framework which will 
address the shortcomings in ... 



53 An adaptation of dataflow methods for WYSIWYG document processing 

Donald D. Chamberlin 

— Proceedings of the ACM conference on Document processing systems January 2000 



100 



http://port Ipv cm org/results cfm query 8 8cl ssify or cl ssifie or 1/8/ 



Results ^ ^ Page 4 o 

54 Network simulation: LoSaliead revisited in wireless network simulations lOO 

pft Jason Liu , David M. Nicol 

— Proceedings of the sixteenth workshop on Parallel and distributed simulation May 2002 
Rapid growth in wireless communication systems motivates the development of technology 
supporting the simulation of large-scale wireless systems. However, it is widely recognized that 
wireless communications do not have substantial "lookahead" needed by conservative 
synchronization protocols. This paper focuses on identifying and exploiting lookahead for such 
models. We find lookahead in three ways, exploiting characteristics of low power networks, the 
transceiver logic, and the way in which pro ... 



55 Advertising and Security for E-Commerence: Protecting electronic commerce from lOO 
12 distributed denial-of-service attacks 

Jose Brustoloni 

Proceedings of the eleventh international conference on World Wide Web May 2002 
It is widely recognized that distributed denial-of-service (DDoS) attacks can disrupt electronic 
commerce and cause large revenue losses. However, effective defenses continue to be mostly 
unavailable. We describe and evaluate VIPnet, a novel value-added network service for protecting e- 
commerce and other transaction-based sites from DDoS attacks. In VIPnet, e-merchants pay 
Internet Service Providers (ISPs) to carry the packets of the e-merchants' best clients (called VIPs) 
in a privileged class ... 



56 Extraction and Visualization: A flexible learning system for wrapping tables and lOO 

12 lists in HTML documents 

William W. Cohen , Matthew Hurst , Lee S. Jensen 

Proceedings of the eleventh international conference on World Wide Web May 2002 
A program that makes an existing website look like a database is called a wrapper. Wrapper 
learning is the problem of learning website wrappers from examples. We present a wrapper-learning 
system called WL^ that can exploit several different representations of a document. Examples of 
such different representations include DOM-level and token-level representations, as well as two- 
dimensional geometric views of the rendered page (for tabular data) and representations of th ... 



57 FLAME: Formal Linear Algebra Methods Environment lOO 
John A. Gunnels , Fred G. Gustavson , Greg M. Henry , Robert A. van de Geijn 
ACM Transactions on Mathematical Software (TOMS) December 2001 
Volume 27 Issue 4 

Since the advent of high-performance distributed-memory parallel computing, the need for 
intelligible code has become ever greater. The development and maintenance of libraries for these 
architectures is simply too complex to be amenable to conventional approaches to implementation. 
Attempts to employ traditional methodology have led, in our opinion, to the production of an 
abundance of anfractuous code that is difficult to maintain and almost impossible to upgrade. Having 
struggled with these is ... 



58 Learning visual concepts lOO 

Lft Jean M. Buijs , Michael S. Lew 

— Proceedings of the seventh ACM international conference on Multimedia (Part 2) October 1999 



59 DAOP-ADL: an architecture description language for dynannic connponent and lOO 
aspect-based development 

Monica Pinto , Lidia Fuentes , Jose Maria Troya 

Proceedings of the second international conference on Generative programming and 

component engineering September 2003 

Architecture description languages deal with the description, analysis and reuse of software 
architectures. This paper describes DAOP-ADL, a component- and aspect-based language to specify 
the architecture of an application in terms of components, aspects and a set of plug-compatibility 
rules between them. With the aim of connecting the specification of the application architecture to 
the implementation, we describe our language using XML and XML Schemas. The DAOP-ADL 
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60 Refactoring for generalization using type constraints lOO 

1^ Frank Tip , Adam Kiezun , Dirk Baumer 

ACM SIGPLAN Notices , Proceedings of the 18th ACM SIGPLAN conference on Object-oriented 
programing, systems, languages, and applications October 2003 
Volunne 38 Issue 11 

Refactoring is the process of applying behavior-preserving transfornnations (called "refactorings") in 
order to inriprove a program's design. Associated with a refactoring is a set of preconditions that 
must be satisfied to guarantee that program behavior is preserved, and a set of source code 
modifications. An important category of refactorings is concerned with generalization (e.g., Extract 
Interface for re-routing the access to a class via a newly created interface, and Pull Up Memb ... 
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